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Abstract — Characterizing the fundamental tradeoffs for max- 
imizing energy efficiency (EE) versus spectrum efficiency (SE) 
is a key problem in wireless communication. In this paper, we 
address this problem for a point-to-point additive white Gaussian 
noise (AWGN) channel with the transmitter powered solely via 
energy harvesting from the environment. In addition, we assume 
a practical on-off transmitter model with non-ideal circuit power, 
i.e., when the transmitter is on, its consumed power is the sum 
of the transmit power and a constant circuit power. Under 
this setup, we study the optimal transmit power allocation to 
maximize the average throughput over a finite horizon, subject 
to the time-varying energy constraint and the non-ideal circuit 
power consumption. First, we consider the off-line optimization 
under the assumption that the energy arrival time and amount 
are a priori known at the transmitter. Although this problem 
is non-convex due to the non-ideal circuit power, we show an 
efficient optimal solution that in general corresponds to a two- 
phase transmission: the first phase with an EE-maximizing on-off 
power allocation, and the second phase with a SE-maximizing 
power allocation that is non-decreasing over time, thus revealing 
an interesting result that both the EE and SE optimizations 
are unified in an energy harvesting communication system. We 
then extend the optimal off-line algorithm to the case with 
multiple parallel AWGN channels, based on the principle of 
nested optimization. Finally, inspired by the off-line optimal 
solution, we propose a new online algorithm under the practical 
setup with only the past and present energy state information 
(ESI) known at the transmitter. 

Index Terms — Energy harvesting, power control, energy effi- 
ciency, spectrum efficiency, circuit power. 



I. Introduction 

GREEN or energy efficient wireless communication has 
recently drawn significant attention due to the growing 
concerns about the operator's cost as well as the global 
environmental cost of using fossil fuel based energy to power 
cellular infrastructures JT], ||2], ||3]. To achieve the optimal en- 
ergy usage efficiency for cellular networks, various innovative 
"green" techniques across different layers of communication 
protocol stacks have been proposed H, ||5]. Among others, 
how to maximize the bits-per- Joule energy efficiency (EE) for 
the point-to-point wireless Unk has received a great deal of 
interest recently ||6l, Q, ||8l. 

Besides maximizing EE, another key design objective in 
wireless communication is to maximize the spectrum efficiency 
(SE) or the number of transmitted bits-per-second-per-Hz 
(bps/Hz), due to the explosive growth of wireless devices and 
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Fig. I. Tradeoff between EE and SE for the ideal circuit power case of 
a = (the left sub-figure) and the non-ideal circuit power case of a > 
(the right sub-figure). 



applications that require high data rates. In order to design 
wireless communication systems both energy and spectrum 
efficiently, the fundamental EE-SE relationship needs to be ex- 
amined carefully. For the simple additive white Gaussian noise 
(AWGN) channel with bandwidth W and noise power spectral 
density Nq, by applying the Shannon's capacity formula, the 
SE and EE are expressed as ^se = log2(l + WW^) ^nd 
Cee = W^log2(l + )/P^ respectively, with P denoting 



the transmit power It thus follows that the optimal EE-SE 
tradeoff is characterized by ^ee = (2':sl-''i)jVo ' where ^ee 
is a monotonically decreasing function of ^se, as shown in 
the left sub-figure of Fig. [T] In this case, any SE increment 
will inevitably result in a decrement in EE. However, in 
practical wireless transmitters, besides the direct transmit 
power P, there also exists non-ideal circuit power consumed 
when P > 0, which accounts for the power consumptions 
at e.g. the AC/DC converter and the analog radio frequency 
(RF) amplifier, and amounts to a significant part of the total 
consumed power at the transmitter Moreover, when there is no 
data transmission, i.e., P = 0, the transmitter can turn into a 
micro-sleep mode ||9|, by switching off the power amplifier 
to reduce the circuit power consumption. For the ease of 
description, in this paper the transmitter status with P > and 
P = are referred to as the on and off modes, respectively. 
Denote the non-ideal circuit power during an "on" mode as 
a > in Watt, the efficiency of the RF chain as < 77 < 1, 
and the power consumed during an "off" mode as /3 > in 
Watt. A practical power consumption model for the wireless 
transmitter is given by ||T0| 
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(1) 



where Ptotai denotes the total power consumed at the trans- 
mitter In practice, /3 is generally much smaller as compared 
to a and thus can be ignored for simplicity |]5], lISl- In 
this paper, we assume /3 — 0. Therefore, in ([T) without loss 
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of generality we can further assume r/ ~ 1 since r/ is only a 
scaling constantQ With the above simplifications, the EE can 
be re-expressed as ^ee = T^log2(l + ww^)/{P + o;) for 
P > and the resulting new EE-SE tradeoff is shown in the 
right sub-figure of Fig. [T]for a given a > 0, from which it is 
observed that the non-ideal circuit power drastically changes 
the behavior of the EE-SE tradeoff as compared to the ideal 
case of a = 0. 

Recently, a new design paradigm for achieving green wire- 
less communication has drawn a great deal of attention, in 
which wireless terminals are powered primarily or even solely 
by harvesting the energy from environmental sources such as 
solar and wind, thereby reducing substantially the energy cost 
in traditional wireless systems ||4], ifTTI . With the embedded 
energy harvesting device and rechargeable battery, wireless 
transmitters can replenish energy from the environment with- 
out the need of replacing battery or drawing power from the 
main grid. Thus, communication utilizing energy harvesting 
nodes can promisingly achieve a jointly spectrum and en- 
ergy efficiency maximization goal. However, there are new 
challenges in designing energy harvesting powered wireless 
communication, which are not present in traditional systems. 
For example, the intermittent nature of most practical energy 
harvesting sources causes random power availability at the 
transmitter, due to which a new type of transmitter-side power 
constraint, namely energy harvesting constraint, is imposed, 
i.e., the energy accumulatively consumed up to any time 
cannot exceed that accumulatively harvested. As a result, 
existing EE-SE tradeoffs (cf. Fig. [U revealed for conventional 
wireless systems assuming a given constant power supply 
are not directly applicable to an energy harvesting system, 
with or without the non-ideal circuit power It is worth 
noting that some prior work in the literature has investigated 
the throughput-optimal power control policies for the energy 
harvesting wireless transmitter assuming an ideal circuit-power 
model (i.e., a — 0,1] — 1, and /3 = in in which useful 
structural properties of the optimal solution were obtained (see 
e g- lfT2l . ifTJl . lfT4l and references therein). However, there is 
very limited work on studying the effects of the non-ideal 
circuit power with a > on the throughput-optimal power 
allocation for energy harvesting communication systems. To 
our best knowledge, only I15J has proposed a calculus-based 
approach to address this problem; however, it does not reveal 
the structure of the optimal solution. Motivated by the known 
result that the non-ideal circuit power modifies the EE-SE 
tradeoff considerably in the conventional case with constant 
power supply as shown in Fig.[Tl we expect that it should also 
play an important role in the EE-SE tradeoff characterization 
under the new setup with random power supply due to energy 
harvesting, which motivates our work. 

In this paper, we study the throughput maximization prob- 
lem for a point-to-point AWGN channel with an energy 
harvesting powered transmitter over a finite horizon. For the 
purpose of exposition, we assume that the receiver has a 
constant power supply (e.g. battery). We also assume that at 

' Note that the results of this paper can be readily extended to the case with 
r; < 1 by appropriately scaling the obtained solutions. 



the transmitter, the renewable energy arrives at a discrete set 
of time instants with variable energy amount. Under this setup, 
we investigate the effects of the non-ideal circuit power with 
a > on the throughput-optimal power allocation as well as 
the resulting new EE-SE tradeoff. The main contributions of 
this paper are summarized as follows. 

• First, we consider the off-line optimization under the 
assumption that the energy arrival time and amount for 
harvesting are a priori known at the transmitter We 
show that the optimal power allocation to maximize 
the average throughput under this setup is a non-convex 
optimization problem, due to the non-ideal circuit power 
Nevertheless, we derive an efficient optimal solution for 
this problem, which is shown to correspond to a novel 
two-phase transmission structure: the first phase with an 
EE-maximizing on-off power allocation, and the second 
phase with a SE-maximizing power allocation that is non- 
decreasing over time. Thus, we reveal an interesting result 
that both the EE and SE optimizations are unified in an 
energy harvesting powered wireless system. 

• We then extend the optimal off-line policy for the single- 
channel case to the general case with multiple parallel 
AWGN channels, subject to a total energy harvesting 
power constraint. Using tools from nested optimization, 
we transform this problem with multi-dimensional (vec- 
tor) power optimization to an equivalent one with only 
one-dimensional (scalar) power optimization, which can 
then be efficiently solved by the algorithm derived for the 
single-channel case. 

> Furthermore, inspired by the off-line optimal solution, we 
propose a heuristic online algorithm under the practical 
setup where only the causal (past and present) energy 
state information (ESI) for harvesting is assumed to be 
known at the transmitter. It is shown by simulations 
that the proposed online algorithm achieves a small 
performance gap from the throughput upper bound by 
the optimal off-line solution, and also outperforms other 
heuristically designed online algorithms. 

The rest of this paper is organized as follows. Section 
nil introduces the system model and presents the problem 
formulation. Section [III] derives the optimal off-line power 
allocation policy for the single-channel case. Section |IV] 
extends the result to the multi-channel case based on the 
nested optimization. Section [V] presents the proposed online 
algorithm and Section |VT]evaluates its throughput performance 
by simulations. Finally, Section IVlIl concludes the paper 

II. System Model and Problem Formulation 

In this paper, we consider the point-to-point transmission 
over an AWGN channel with constant channel and coherent 
detection at the receiver The transmitter is assumed to re- 
plenish energy from an energy harvesting device that collects 
energy over time from a renewable source (e.g. solar or wind). 
We consider the block-based energy scheduling with each 
block spanning over T seconds (sees). We assume that the 
renewable energy arrives during each block at — 1 time 
instants given by < ti < ■ • • < t^^i < T, and the 
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energy values collected at these time instants are denoted 
by £'1, . . . , i?jv_i, respectively. In general, > 1, ti, and 
Ei > 0, i = 1, . . . , — 1, are modeled by an appropriate 
random process for the given energy source. For convenience, 
we assume to = and denote Eq as the initial energy stored in 
the energy storage device at the beginning time of each block. 
For the purpose of exposition, we assume that the energy 
storage device has an infinite capacity in this paper Moreover, 
we refer to the time interval between two consecutive energy 
arrivals as an epoch, and denote the length of the ith epoch 
as Li = ti — ti^i, i = 1, . . . , A^; for convenience, we denote 

Suppose that the transmit power over time in each block is 
denoted by P{t) > 0,t G (0,r]. Assume that the maximum 
transmission rate that can be reliably decoded at the receiver 
at any time t is a function of P{t), given by C{t) = R{P{t)), 
which satisfies the following properties: 

1) RiPit)) > 0, VP(t) > 0, and R{0) = 0; 

2) R{P{t)) is a strictly concave function over P{t) > 0; 

3) R{P{t)) is a monotonically increasing function over 
P{t) > 0. 

For example, if adaptive modulation and coding (AMC) is 
applied at the transmitter, then the achievable rate C{t) is 
denoted by lfT6l 

TWNoJ 

in bits-per-sec (bps), where F accounts for the gap from the 
channel capacity due to a practical coding and modulation 
scheme used; h > denotes the constant channel power gain. 

As discussed in Section |I] we assume an on-off transmitter 
power model given in ([U with (3 = and r/ = 1; thus, we 
rewrite ([T} as 

P{t) + a, P{t)>0 
0, P{t) = 0. 

Since the accumulatively consumed energy up to any time at 
the transmitter cannot exceed the energy accumulatively har- 
vested, the energy harvesting constraint on the total consumed 
power is given by 



RiPit)) = W\og^ 1 



total 



(3) 



/ Ptote.i{t)dt<y2Ej, 1 = 1,. ..,N. 
•^0 J=0 



(4) 



Thus, the throughput maximization problem over a finite 
horizon T can be formulated as follows. 

r-T 



max 

P(t)>0 



R{P{t))dt 







s.t. / PtotaiWdi< VSj, ^ = l,...,Af. (5) 
•^0 ,=0 
The optimal online solution for the above problem with the 
causal ESI, i.e., for any given t, only Ei's with ti < t sae 
known at the transmitter, can be numerically solved by the 
technique of dynamic programming similar to |[T3l . However, 
such a solution is of high computational complexity due to 
"the curse of dimensionality" for dynamic programming. In 
addition, the resulting solution will not provide any insight to 



the structure of the optimal power allocation for an energy 
harvesting transmitter Therefore, in this paper, we take an 
alternative approach by first solving the off-line optimization 
for (|5]l, assuming that all the energy arrival time i^'s and 
amount Ei's are a priori known at the transmitter in each 
block transmission, and then based on the structure of the 
off-Une optimal solution, devising online algorithms for the 
practical setup with only causal ESI known at the transmitter 
For the off-line optimization of (|5]l, it is easy to see that the 
objective function is concave; however, the constraint is non- 
convex in general since Ptotai(t) in (O is a concave function 
of P{t) if a > 0. As a result, the problem is in general 
non-convex and thus cannot be solved by standard convex 
optimization techniques. In the next section, we will propose 
an efficient solution for this problem by exploiting its special 
structure. 

Remark 2J: It is worth noting that for the off-line opti- 
mization, Q can be shown to be convex if a = 0. In this 
case, similar problems to (|5) have been studied in the literature 
|[T2l . lfT3l . in which the throughput-optimal power allocation 
P{t) was shown to follow a non-decreasing piecewise-constant 
(staircase) function over t. This power allocation can be 
interpreted as maximizing the SE of the point-to-point AWGN 
channel subject to the new energy harvesting power constraint. 
As will be shown later in this paper, the non-ideal circuit power 
with a > will change the optimal power allocation for this 
problem considerably. 

III. Off-Line Optimization 

In this section, we solve the off-line optimization problem 
in (|5j with the non-ideal circuit power, i.e., a > 0. 

A. Reformulated Problem 

First, we give the following lemmaH 

Lemma 3 J: During any ith epoch {ti-i,ti], i = 1, . . . , N, 
the optimal solution for (|5]i is given by P{t) = Pi > for 
the portion of time C (tj_i,tj], and P{t) = for the 
remaining time C {t,^i,t,], where 7^°" n 7^°* = (j) and 

Proof: See Appendix lAl ■ 
According to Lemma 13.11 and by denoting the constant 
transmit power Pi > for the "on" period with length 
< < Li in the ith epoch, (|5]) can be reformulated as 

N 



max Yl°''R{P,) 



s.t. P > 0, i = l,...,N 

0<ir<L„ i = l,...,N 



i-1 



^ (Pj + a)lf <^E„ i = \,...,N. (6) 

However, the above problem is still non-convex due to the 
coupling between P^'s and ^°"'s. In the following, we first 
solve this problem for the special case of = 1 and then 
generalize the solution to the case with A^ > 1. 

-We thank the anonymous reviewer who brought our attention to |15| , in 
which an alterative proof for Lemma lxTl is given based on a calculus approach. 
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B. Single-Epoch Case with N ^ 1 

In the single-epoch case with = 1, the problem in dSJ is 
reduced to 

max irRiPi) 

s.t. Pi > 

< Z?" < T 

Zr(Pi +a)< Eo. (7) 

The solution of the above problem is given in the following 
proposition. 

Proposition 3.1: The optimal solution and for (|7]l 
is expressed as 



= max ( Pee, ^ - a 



/on* 



En 



P* 



a 



where P^e is given by 



Pee = arg max — 

Pi>0 Pi 



(8) 



(9) 



(10) 



Proof: See Appendix iB] ■ 
It is worth noting that Pee given in ( fTOl i is the optimal 
power allocation that maximizes the EE of the AWGN channel 
under the non-ideal circuit power model as shown in Q. 
From Proposition 13.11 it follows that if Pee > — a, we 
have Pi* = Pee, IT* < T and if* = T - If* > 0, 
which corresponds to an on-off transmission. However, if 
Pee < -fi - a, we have P* = ^ ~ a, ^'j'"* = T and if* = 0, 
which corresponds to a continuous transmission. We will see in 
the next subsection that the EE-maximizing power allocation 
Pee plays an important role in the general case with > 1. 
Also note that the right-hand side (RHS) of (fTOl i is a quasi- 
concave function of Pi since it is concave-over- linear ifTTI : 
thus. Pee can be efficiently obtained by a simple bisection 
search ifTTll . 



C. Multi-Epoch Case with N > I 

Inspired by the solution for the single-epoch case, we derive 
the optimal solution for ^ in the general case with > 1, 
as given by the following theorem. 

Theorem 3.1: The optimal solution of (|6]l, denoted by 
[Pi*, ... , P;^] and [If*, /™*], is obtained as follows. De- 
noting 

iee,0 = 0, 



Y-vi— 1 






-1+1 ^fe 




-1 + 1,. 



-a< Pee, 



J = arff max i 

^ee — ^ee,J, 



the optimal transmit power for epochs 1, . . . , iee is given by 

P^ Pee, ^ 1, • • • , ^ee (12) 

and the optimal on-period lf*,i = l,...,iee, is any set of 
non-negative values satisfying 

iPee+a)J2'" ir-J2^"~'E, (13) 

{Pee + a) , If* < n E^, J ^ 1, ■ . . , lee- (14) 

Moreover, for epochs iee + 1, • . • , A^, the optimal solution is 
given by 

if* ^U, l^lee + l,...,N (15) 
P*^ ^"^"'-^ I -a, l^iee + l,.-.,N (16) 



X]*;=ni-i + l 



where 



rii = are- mm < ■. 

».-l+l<.<^\EL„.-i+liA 



,iv. 



(17) 



(11) 



Proof: See Appendix ICl ■ 
It is interesting to take note that the optimal transmission 
policy given in Theorem 13. II has a two-phase structure, which 
is explained as follows in more details. 

• < t < ti^^ : In the first phase, the optimal transmission 
is an on-off one with a constant power Pee for all the 
on-periods. Note that Pee is the EE-maximizing power 
allocation given in ( fTOl i. Also note that the optimal 
on-periods lf*,i = l,...,iee, may not be unique 
provided that they satisfy the conditions given in (fT3T l 
and (fT4l i. Without loss of generality, we assume that in 
each epoch, the transmitter chooses to be on initially 
with power Pee provided that its stored energy is not 
used up, i.e.. If* min(£:o/(-Pee + Q.),Li), If* = 
min {{El +Eo- {Pee + a)lf*)/{Pee + a), L2), and so 
on. 

• ^iee < t < T. In the second phase, a continuous trans- 
mission is optimal. Since If* ~ Li, i = iee + ^, ■ ■ ■ , N, 
the problem in (|6]l for i = iee + ^,---,N,is reduced to 

TV 

max E] LiR{Pi) 

^^^^ i=iee + l 

S.t. Pi> 0, i = iee + ■ ■ ■ ,N 

i i—1 i 

J2 L,P,<Y,E,- E L,a, 

i^lee + l,...,N. (18) 

The optimal solution for the above problem has been 
shown in lfT2l . lfT3l to follow a non-decreasing piecewise- 
constant (staircase) function, which is given in ( fT6] l. It is 
worth noting that the staircase power allocation achieves 
the maximum SE for an equivalent AWGN channel sub- 
ject to a sequence of energy harvesting power constraints 
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(modified to take into account the circuit power a) for 

i = iee + I, ■ ■ ■ , N. 

From the above discussion, it is revealed that for the 
throughput maximization in an energy harvesting transmission 
system subject to the non-ideal circuit power, the optimal 
transmission unifies both the EE and SE maximization policies 
independently developed in Q and lfT2l . lfT3l . respectively. To 
summarize, one algorithm for solving the problem in (|6) for 
the general case of > 1 is given in Table I. 

TABLE I 

Optimal Off-Line Policy for Single-Channel Case 



Algorithm 



1) Calculate Pee and obtain iee as 

iee,0 = 0, 

Ei — l 



-a< Pe, 



i = ieeJ^l+l,---,N 'fj >1, 



J = are maxi, 
2) For the first iee epochs, set 

P^ — Pee^ 2 = 1, . . . , 2e 
,on* / -^0 



1° 



.Pee + a 

Pee + a 



U ] ,i = 2 



, . . . , tee • 



3) If iee = N, algorithm ends; otherwise, set 

C* =Li,i = iee+l,...,N. 

4) Reset 

E'o 4^ 0,T' T - ^'"^ Lj,N' - iee, 



Lj <— Lj^i^^ , E- <— E. 



3 

■j+i 



5) Detennine 



«min = arg min 
j 



Pt 



j = 1, 

k=0 ^k 



and set transmit power as Pt in the next imin epochs. 
6) If 

^min — algorithm ends; otherwise, reset the parameters as 
follows, and go to 5). 

E'o ^ 0, T' T' - J2'"^° L'^,N' ^N' ~ i^i^, 



Remark 3.1: We discuss some implementation issues on 
energy harvesting communication systems with the optimal 
transmit power allocation given in Theorem 13.11 It is worth 
noting that in practical wireless systems, the duration of a 
communication block is usually on the order of millisecond, 
while the energy harvesting process evolves at a much slower 
speed, e.g., solar and wind power typically remains constant 
over windows of seconds. As a result, each epoch between any 
two consecutive energy arrivals in our model (during which 



the optimal power policy in Theorem 13.11 allocates a constant 
power) can be assumed to be sufficiently long, thus containing 
many communication blocks. In each communication block, 
pilot signals can be transmitted to help estimate the signal 
power at the receiver, which may change from one epoch to 
another due to transmit power adaptation; thus, the transmis- 
sion rate in (|2|i is practically achievable with AMC at the 
transmitter and coherent detection at the receiver 

Example 3.1: To illustrate the optimal two-phase transmis- 



sion given in Theorem 13.11 we consider an example of a 
band-limited AWGN channel with bandwidth W = IMHz 
and the noise power spectral density Nq = 10~^^Watts-per- 
Hz (W/Hz). We assume that the attenuation power loss from 
the transmitter to the receiver is h = — 80dB. Considering 
the channel capacity with F = 1, we thus have R{P) = 
P^log2(l+ rj^pV ) = log2(l + 100P)Mbps. It is assumed that 
the energy arrives at time instants [0, 4, 6, 11, 14, 16, 18]sec, 
and the corresponding energy values are [0.5, 0.5, 0.5, 1, 0.5, 

0. 75, 0.5]Joule, as shown in Fig|2] It is also assumed that 
T = 20secs and the circuit power is a =1 15.9mW. Under this 
setup, we compute Pee ~ 79.2mW. In Fig. [21 we compare the 
optimal allocation of the total consumed transmitter power by 
the algorithm in Table I with that obtained by the algorithm 
given in ini, Note that the algorithm in O or O 
solves (|6]l in the special case with the ideal circuit power 
a = 0. Here, we apply this algorithm to obtain a suboptimal 
power allocation with a > 0, by assuming that the transmitter 
is always on, i.e., ^™ = Lj , i = 1, . . . , N. As observed in Fig. 
I2a), the optimal power allocation has a two-phase structure, 

1. e., an on-off transmission with transmit power Pee when 
< t < is followed by a continuous transmission with non- 
decreasing staircase power allocation when < t < T, which 
is in accordance with Theorem 13.11 In contrast, as observed 
in Fig. I2b), the suboptimal power allocation by the algorithm 
in lfT2l or lfT3l with the transmitter always on results in a 
continuous transmission with non-decreasing staircase power 
allocation during the entire block i.e. < t < T. In addition, 
it can be shown that the proposed optimal solution achieves 
the total throughput 63.14Mbits, while the suboptimal solution 
achieves only 55.80Mbits, over T = 20secs. 

IV. Multi-Channel Optimization 

In this section, we extend the optimal off-line power al- 
location for the single-channel case to the general case with 
multiple parallel AWGN channels subject to a total energy 
harvesting power constraint and the non-ideal circuit power 
consumption at the transmitter The multi-channel setup is 
applicable when the communication channel is decomposable 
into orthogonal channels by joint transmitter and receiver sig- 
nal processing such as OFDM (orthogonal frequency division 
multiplexing) and/or MIMO (multiple-input multiple-output). 

Without loss of generality, we assume a power vector 
Q(i) = [Qi{t),...,QK{t)] hO,te (0,T], with each element 
denoting the power allocation over time in one of a total K 
parallel AWGN channels, where Q(t) ^0 denotes that Q{t) 
is elementwise no smaller than zero. We also assume a sum- 
throughput over the K channels, denoted by C{t) = R{Q{t)), 
which satisfies 
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Fig. 2. Power allocation by off-line policies: (a) the optimal off-line policy; 
and (b) the off-hne policy in 1121 . 1131 . 



1) i?(Q(i)) > 0,VQ(t) h 0, and i?(0) = 0; 

2) R{Q{t)) is a strictly joint concave function over Q{t) ^ 
0; 

3) i?(Q(t)) is a monotonically increasing function with 
respect to each argument in Q(t) i.e. Qk{t) > 0, fe = 
1,...,K. 

An example of the above multi-channel sum-throughput is the 
sum-rate over K parallel AWGN channels achieved by joint 
AMC, which is given by 

R{Q{t)) = WJ2 log2 f 1 + , (19) 



fc=i 



rwN, 



where hk > denotes the channel power gain of the fcth 
channel. Similar to (O in the single-channel case, by taking 
into account the non-ideal circuit power a, the total power 
consumed at the transmitter for the multi-channel case is 
modeled by 

gtotai(i) - { (20) 

fe=i 

Then the throughput maximization problem over a finite 
horizon T in the multi-channel case is formulated as 



max f R(Q(t))dt 
Q(*)bo Jo 



i-l 



•t. / gtotaiWdt< V-B,, ^ = 1,...,A^. (21) 



Similar to (|5]l, the above problem is non-convex with a > 
and thus cannot be solved by standard convex optimization 
techniques. In the following, we will apply the principle 
of nested optimization to convert this problem with multi- 
dimensional (vector) power optimization to an equivalent prob- 
lem with only one-dimensional (scalar) power optimization, 
which is then optimally solvable by the algorithm in Table I 
for the single-channel case. 

To apply the nested optimization, we first introduce an 

K 

auxiliary variable P{t) = E Qk{t), and rewrite the objective 



fc=i 



function of (l2Tl i equivalently as 



max 

P{t)>0 



: max 

P(t)>0 



Q{t):Qk{t)>oyk,j: Qk{t)<P{t) 

k = l 



max 

Q(t):Qfc(t)>0,Vfc, E Qk(t)<Pit) 

k=l 



R{Q{t))dt 
R{Q{t))dt. (22) 



Define the auxiliary function 

R{P{t)) m 



R{Q{t)) 



Q(t):Qfc(*)>o,Vfc, E Qk{t)<P{t) 

k=l 



(23) 



for which it can be easily verified that the maximum is attained 

K 

when E Qk{t) = P{t)- Thus, without loss of generality, we 

k=l 

can rewrite (l2rt equivalently as 



max / R(P(t))dt 
P{t)>o Jo 

S.t. / Ptotal(t)dt< 

Jo 



where 



-Ptotal(<) = 



P{i) + a, P{t) > 
0, P{t) = 0. 



, N (24) 



(25) 



Thus, the original problem with vector power optimization is 
converted by the nested optimization to an equivalent problem 
with only scalar power optimization. Thereby, we can first 
solve ( |24] | to get the optimal solution of P{t), and then with the 
obtained P{t) solve (l23T l to find the optimal solution of Q(t) 
for (l2n i. Since the problem in (l23T l is a convex optimization 
problem, it can be solved by standard techniques e.g. the 
Lagrange duality method ifTTl (in the special case of the sum- 
rate given in ( fT9] l, the optimal solution can be obtained by the 
well-known "water-filling" algorithm ifTTl ). 

In order to solve (iMt . we first give the following proposi- 
tion. 

Proposition 4.1: The function R{P{t)) satisfies the follow- 
ing properties: 

1) R{P{t)) > 0,VP(i) > 0, and i?(0) = 0; 

2) R{P{t)) is a strictly concave function over P{t) > 0; 

3) R{P{t)) is a monotonically increasing function over 
Pit) > 0. 

Proof: See Appendix Id] ■ 
Since R{P{t)) satisfies the same conditions as R{P{t)) for 
the single-channel case, it follows that ( |24] | can be similarly 
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solved by the algorithm in Table I, with one minor modifica- 
tion: the EE-maximizing power allocation in the multi-channel 
case needs to be obtained as 



Pee = argmax 



R{P) 



P>Q P + a 



max 



i?(Q) 



are: max ■ 

P>0 



Q:Qfc>o,vfe, y: Qk<P 

k = l 



P + a 



(26) 



Since the maximum in the above problem is attained by 

K 

J2 Qk = the optimal solution of Q is obtained as 

fc=i 



Q"^" = arg 



max 



^ (27) 

Q:Qk>o,vk,Y: Qk<P yOk + a 

h — 1 

k=l 



with Q^^ = [Qr,---,QKl and 



K 



Pee — ^ Qk 



(28) 



fc=l 



Since the RHS of jTTl is a quasi-concave function, this 
problem is quasi-convex and thus can be efficiently solved by 
the bisection method fTTl . Here we omit the detail for brevity. 

To summarize, the algorithm for solving (1211 for the multi- 
channel case is given in Table 11. 

TABLE II 

OPTrMAL Off-Line Policy for Multi-Channel Case 



Algorithm 



1) Obtain Pee by solving j27t and j28t : apply the algorithm in Table 
Uto obtain the solution P*(t) for l l24l . 

2) With the obtained P*{t), solve ^23) to obtain the solution Q*{t) 
for (ID. 



V. Online Algorithm 

In the previous two sections, we have studied the optimal 
off-line policies assuming the non-causal ESI at the transmit- 
ter, which provide the throughput upper bound for all online 
policies. In this section, we will address the practical online 
case with only the causal (past and present) ESI assumed to 
be known at the transmitter In particular, we will propose 
an online policy based on the structure of the optimal off- 
line policy obtained previously in Section HUl Due to the 
space limitation, we will only consider the single-channel case 
for the study of online algorithms, while similar results can 
be obtained for the general multi-channel case, based on the 
optimal off-line policy given in Section IIVI 

A. Proposed Online Algorithm 

For the purpose of exposition, we assume that the harvested 
energy is modeled by a compound Poisson process, where 
the number of energy arrivals over a horizon T follows a 
Poisson distribution with mean AgT and the energy amount in 
each arrival is independent and identically (i.i.d.) distributed 
with mean E. It is assumed that Ae and E are known at the 
transmitter. 



We propose an online power allocation algorithm based 
on the structure of the optimal off-line solution revealed in 
Theorem 13.11 Specifically, considering the start time of each 
block, from Theorem 13. II we obtain the closed-form solution 
for the optimal off-line power allocation at i = in the 
following proposition. 

Proposition 5.1: Suppose there are — 1 energy arrivals 
in (0,r) with iV > 1, the optimal off-line power allocation 
solution for (|6]l at t = is given by 



P*(0) = max 



min l ^kJoEk _^ 

i=l,....N 



Xk=iLk 



> ^ FA 



(29) 



Proof: See Appendix [E] ■ 
Note that in ( |29] |. Eq is available at the transmitter at t = 0, 
while A^, i;^, z = 1, . . . , A^ - 1, and Lj, z = 1, . . . , A^, are all 
unknown at the transmitter due to the causal ESI. As a result, 
we cannot compute P*(0) in ( |29l ) at i = for the online 
policy. Nevertheless, we can approximate the expression of 
P* (0) based on the statistical knowledge of the energy arrival 
process, i.e., Ae and E, as follows. 
Denote 



En 



En 



ELi Pk ELi ^k ELi ^k 



For any i < N, J2k=i -^k is the total energy harvested during 
(0,tj) and ^k = ti- We thus have 



J2k=l ^k ^etiE 



XeE, Vl< j < A^, 



where the approximation becomes exact when ti 

ELo Ek 



(30) 



Using ( [30b , we can approximate 



oo. 



XeE, and 



k=l ^k 



mm 

1=1 N 



mm 

= l,...,Ar 



SLo Ek 
Ek 

Eg 

,T,k=iEk 



XeE 



Eo 
T 



XeE 



and then obtain 



P*(0) 



Eo 
T 



XeE - a, Pee 



(31) 



Since Eq, Ae and E are all known at the transmitter at i = 0, 
OTI ) can be computed in real time. 

For any < f < T, by denoting the stored energy as 
Es{t) with Es{0) = Eq, we can view the online throughput 
maximization at time t over the remaining time T — t to have 
an initial stored energy Eq = Es{t). Therefore, by replacing 
T and Eq in (l3Tl i as T ~ t and Es{t), respectively, we obtain 
the following online transmit power allocation policy: 



P 



online 



T-t 



XeE - a, Pe, 



0, 



Es{t) > 
Esit)^0 
(32) 



for any t e [0,T). 
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Fig. 3. Power allocation by the proposed online policy. 



The online policy in ( |32] ) provides some useful insights. 



Note that 



Es{t) 



1 ^^e^ OL can be viewed as the "expected" 
available transmit power for the remaining time in each block. 



which can be negative for some t if 



Es{t) 
T-t 



KE < 



Thus, if this value is less than the EE-maximizing power 
allocation Pee, the transmitter should transmit with Pee to save 
energy; however, if the inequality is reversed, the transmitter 
should transmit more power to maximize the SE. Moreover, 
as compared to the optimal off-line power allocation for the 
single-epoch case given in Proposition 13.11 we see that the 
online policy ( |32] | bears a similar structure, by noting that 
Eo/T — a in ^ for the single-epoch case is also the available 
transmit power for the remaining time in each block. Last, it 
is worth remarking that the online power allocation policy in 
(|32] | is expressed as a function of the continuous time for 
convenience; however, in practice, this policy needs to be 
implemented in discrete time steps by properly quantizing the 
continuous-time function. The time step needs to be carefully 
chosen in implementation: On one hand, it is desirable to use 
smaller step values to achieve higher quantization accuracy 
for energy saving, while on the other hand, the time step 
needs to be sufficiently large, i.e., at least larger than one 
communication block (cf. Remark 13.1b so that the receiver 
can have a timely estimate of any transmit power adjustment. 

Example 5.1: To illustrate the proposed online power allo- 
cation policy in ( |32] |. we consider the same channel setup and 
harvested energy process for the off-line case in Example 13. II 
(cf. Fig. |2]l. In Fig. [3] we show the total transmitter power 
consumption by the proposed online policy assuming that 
the exact average harvested power XeE = {^f^^ Ei)/T = 
187. 5mW is known at the transmitter It is observed that the 
online power allocation is no more piecewise-constant like the 
optimal off-line power allocation in Fig. EJa). Nevertheless, 
it is also observed that these two policies result in some 
similar power allocation patterns, i.e., starting with an on- 
off power allocation followed by a non-decreasing (in the 
sense of average power profile for the online policy case) 
power allocation. This suggests that the proposed online policy 
captures the essential features of the optimal off-line policy. 
Moreover, it can be shown that the proposed online policy 
achieves the total throughput 61.61Mbits over T = 20secs, 
which is only 1.53Mbits from 63.14Mbits of the optimal 
off-Une policy. In addition, it can be verified that the total 



- optimal offline policy 

- proposed online policy 

- EEP 

- EIMP 




Fig. 4. Average throughput versus the energy arrival rate \e with E ■■ 
and T =20secs. 



: 0.5J 



throughput obtained by the proposed online policy is very 
robust to the assumed average harvested power value XeE. 
For example, by setting XeE to be 150mW or 200m W, the 
proposed online policy obtains the throughput 61.38Mbits and 
61.60Mbits, respectively, which is a very small loss in either 



VI. Simulation Results 

In the section, we compare the performance of the proposed 
online policy with the performance upper bound achieved by 
the optimal off-line policy under a stochastic energy harvesting 
setup modeled by the compound Poisson process. The amount 
of energy in each energy arrival is assumed to be independent 
and uniformly distributed between and 2E. For the purpose 
of comparison, we also consider two alternative heuristically 
designed online power allocation policies given as follows. 
• Energy Efficient Policy (EEP): In this onUne poUcy, 
the transmitter transmits with the EE-maximizing power 
allocation Pee given in ( fTOl i provided that there is a non- 
zero stored energy, i.e.. 



Peep(<) 



0, 



Es{t) > 
E,{t)^0 



(33) 



for any t G [0,T). 

Energy Neutralization Policy (ENP): This online poUcy 
transmits with a constant power that satisfies the long- 
term energy consumption constraint if there is available 
stored energy, i.e.. 



Pi 



ENP 



XeE — a, 
0, 



Esit) > 
Esit) = 



(34) 



for any t G [0,P). Note that in the above we have 

assumed that XeE > a. 
First, we consider a single AWGN channel in Figs. |4] and 
[5] with the same channel parameters as for Example 13.11 In 
Fig. m we show the average throughput over T = 20secs 
versus A,., with E = O.SJoule (J). It is observed that when Ae 
is small, the proposed online policy and EEP obtain similar 
performance as the optimal off-line policy. Since the average 
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- optimal offline policy 

- proposed online policy 

- EEP 
■ ENP 



f f f 1 ^ 



Fig. 5. Average throughput versus the block duration T with E =0.5J. 




optimal offline policy 
proposed online policy 
EEP 
ENP 



X (sec 



Fig. 6. Average throughput in a single-cell OFDMA downlink system with 
renewable powered BS versus the energy arrival rate Ae with E = 200J and 
T =20secs. 



harvested energy is small when Ae is small, it is more likely 



that Pee is greater than both 



XeE - a (c.f. (Eali) 



and 



„. , a, Vi 1, . . . , iV (c.f. ^). Thus, both the 
proposed online and off-line policies choose to transmit with 
Pee during the "on" periods to save energy. However, for 
ENP, the transmit power level deviates from Pee and thus 
significant amount of energy is consumed due to the non- 
ideal circuit power; as a result, the achievable throughput is 
almost zero. As Ae increases, the throughput gap between the 
optimal off-line policy and all online policies enlarges, and 
the performance of EEP degenerates severely. Moreover, ENP 
is observed to obtain a similar performance as the proposed 

E it) 

online policy. This is because in this case, rf_l is negligibly 
small as compared with XeE, and as a result the proposed 
online policy degenerates to ENP. 

Fig. |5] shows the average throughput versus T, for two 
different values of Ae = 0.3/sec and 1/sec, with E = 0.5J. 
In both cases of Ae, the proposed online policy is observed to 
perform close to the optimal off-line policy, for all values of 
T. With small value of Ae, i.e., Ae = 0.3/sec, EEP performs 
much better than ENP since it is more energy efficient, while 
with a larger value of Ae, i.e., Ae = 1/sec, the reverse becomes 



true, which can be similarly explained as for Fig. E] 

Furthermore, for the multi-channel scenario, in Fig. 0we 
evaluate the average throughput of a single-cell downlink sys- 
tem with the base station (BS) powered by energy harvesting. 
It is assumed that the BS covers a circular area with radius 
1000 meters, and serves K users whose locations are gener- 
ated following a spatial homogeneous Poisson point process 
(HPPP) with density 10~^ users/m^. Consider a simplified 
channel model without fading, in which the channel power 
gain of each user from the BS is determined by a pathloss 

model Co , where cq = — 60dB is a constant equal to 

the pathloss at a reference distance do = 10m, and C = 3 
is the pathloss exponent. Assuming an OFDMA (orthogonal 
frequency division multiplexing access) based user multiple 
access, a total bandwidth W = 5MHz is equally allocated to 
the K users. The noise power spectral density at each user 
receiver is set as A'o = — 174dBm/Hz and F = 1 is assumed. 
The circuit power at the BS is set as a = 60Watt. Fig.|6]shows 
the average throughput versus Ae with E = 200J and an energy 
scheduling period T =20secs. It is observed that the proposed 
online policy always performs better than EEP. With small 
values of Ae, i.e., Ae < 0.8/sec, ENP preforms clearly worse 
than the proposed online algorithm, while when Ae > 0.95/sec, 
both schemes perform similarly. This observation is expected 
as can be similarly explained for Fig. |4] 

VII. Concluding Remarks 

In this paper, we studied the throughput-optimal transmis- 
sion policies for energy harvesting wireless transmitters with 
the non-ideal circuit power. We first obtained the optimal 
off-line solution in the single-channel case, which is shown 
to have a new two-phase transmission structure by unifying 
existing results on separately maximizing energy efficiency 
and spectrum efficiency. We then extended the optimal off- 
line solution to the general case with multiple AWGN channels 
subject to a total energy harvesting power constraint, by the 
technique of nested optimization. Finally, we proposed an 
online algorithm based on a closed-form off-line solution. It 
is shown by simulations that the proposed online algorithm 
has a very close performance to the upper bound achieved 
by the optimal off-line solution, and also outperforms other 
heuristically designed online algorithms. 

After submission of this manuscript, we become aware of 
one interesting related work [19] that is worth mentioning. In 
||T9l, the throughput optimization in a single-channel energy 
harvesting communication system with battery leakage is 
introduced. The impact of battery leakage is very similar to 
that of the non-ideal circuit power considered in this paper; 
as a result, the optimal transmission policy developed in llT9l 
is similar to the one proposed in Section |lll] of this paper. 
One difference is that the optimal off-line policy in |fT9l is 
developed by decoupling the general multi-epoch problem 
into multiple equivalent single-epoch subproblems, while the 
same optimal policy in this paper is derived by decoupling 
the problem into only two subproblems (c.f. Appendix |C]i- 
Compared with the solution in |[T9l . the off-line policy in this 
paper reveals the optimal two-phase structure in which EE and 
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SE optimizations are unified, and thus motivates our online 
policies, which are not given in |fT9l . 



Appendix A 
Proof of LemmaUTT] 



Denote the length of the on-period 7^°" as and that of 
the off-period as if, where /™ + if = Li. Without 
loss of generality, we only need to consider the case with on- 
period 7;™ = {k-i,h_i+lf] and off-period = (t,_i + 
If, ti], since exchanging the power allocation at two different 
time instants in an epoch does not change the throughput and 
the energy constraint. Next, we prove that the transmit power 
should be constant during the on-period in an epoch by 
contradiction. 

Suppose that the optimal allocated transmit power P{t), 
where P{i) > 0,t G {ti-i,ti-i + If is not constant. Since 
R{P{t)) is a strictly concave function, based on Jensen's 
inequality, we have 



R 



> 



ti-i+i° 



Rim) 

ion 



dt, (35) 



and then 



--irn 



/on 



1° 



> I R{P{t))dt. 



(36) 



Thus, if we construct a new transmit power allocation P{t) as 

P{t) = P,^ ;on >0,te {U-l,U.l + If], we 

can achieve a larger throughput than that achieved by P{t). 
Moreover, we verify that P{t) consumes the same total energy 
as P{t) in the ith epoch, i.e.. 



P{t)+a] dt = lf{P,+a) 



P{t)dt + lfa= / {P{t)+a)dt. 

U-i 

(37) 

Therefore, based on ( |36] l and ( |37t . we conclude that P{t) 
cannot be optimal and thus Lemma 13.11 is proved. 



Appendix B 
Proof of Proposition I3.1I 

To solve (|7]l, we note that the third inequality constraint 
must be met with equality by the optimal solution, since other- 
wise the throughput can be further improved by increasing Pi . 
Thus, by substituting /J" ~ p^+a ^^^^ objective function 



as well as the constraint If < T, the problem becomes 
equivalent to finding 



Pi = arg 
= arg 



Pi>0,Pi>Eo/T-a Pi + a 

RiPl) 



R{Pi 



max 



Pi>0,Pi>Eo/T-a Pi + a 



(38) 



Consider first the following problem with the relaxed power 
constraint: 

n,ax^^. (39) 

Pi >o Pi + a 

This problem has been studied in Q, where the globally opti- 
mal solution is known as the EE-maximizing power allocation, 
denoted by P^e- It was also shown in ||7l that given a > 0, 
is monotonically increasing with Pi if < Pi < Pee, 
and monotonically decreasing with Pi if Pi > P^e- Thus, the 
solution of (|38]| is obtained as 



Pi = max ( Pee , — - a 



Accordingly, the optimal on-period is given by 



70n* 

*1 - ^ 



Pi* + a 



(40) 



(41) 



Proposition 13. II is thus proved. 



Appendix C 
Proof of Theorem|3.1| 



To prove Theorem 13. II we construct the following two sub- 
problems Pi and P2 for the power allocation optimization in 
the first iee epochs and the last N — Jee epochs, respectively. 



S.t. < If <U,i = l,...,iee, 



E 



{P,+a)lf <^E,,i = \,...,i, 

j=0 



(42) 



'■ max 



N 



ifRiK 

i = iee + l 

S.t. < If < L,,i^i,e + l,--.,N 

E (P,+a)lf< 

i = i,e + l,...,N. (43) 



We will first prove that the solution given in Theorem 13.11 is 
optimal for both Pi and P2, and then prove that the optimal 
solutions for Pi and P2 are also optimal for (|6). 

First, we prove that the solution given in (fTZt . (fT?i and (fT4] i 
is optimal for Pi. 

Consider the throughput maximization problem Pi, with 
the arrived energy Eq, . . . , -Ei^-i at time to, ... , ti^^-i over 
the horizon Ti = X]a:=i Ek- We construct an auxiliary 
throughput maximization problem Pi with the energy arrival 
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X]fc"o ^fe7 0, ■ • ■ , at time to, ... , ii^^-i over the same hori- 
zon Ti as follows. 



: max VZ°"i?(P,), 

< Li,i ^ 1,. 



1=1 
s.t. < 



< 5^ E,. (44) 

It is clear that the optimal throughput of Pi is an upper bound 
on that of Pi since any feasible solution of Pi is also feasible 
for Pi. Note that there is no energy arrived in ti, . . . , ti„-i for 
Pi, so IPi is indeed equivalent to a throughput maximization 
problem for the single-epoch case studied in Section III- 
B over a horizon Ti. It can be easily verified based on 

O that =^ - a < Pee-, thus, it follows after 

some simple manipulation that the optimal value of Pi is 

yi„-i ^ 

=^ --RiPee) and is attained by Pi* = ■ • ■ = f* = Pee 

Pee + a 

and 

/on* ^ £^±± Meanwhile, for Pi we can always 

1^1^13 I 

construct a feasible solution based on (fTSl i. ( fT3T l and ( fT4l i by 
setting 



Pee 7 • ■ ■ ; ^eei 

Z— //C — lee,j-l 



.J, 



Pee 



E 

A; — ^ee,j — 1 



ife, Y7 = l,, 



(45) 

It can be verified from (fTTt that the solution satisfying (l45t is 
feasible for Pi, and attains an objective value of — ■ 

_Pee + a 

R{Pee), which is the same as the optimal value of Pi. The gap 
between Pi and Pi is thus zero, and accordingly, the solution 
given in (fTZt . (fTsT l and (fT4] i is optimal for Pi. 

Second, we prove that the solution in ( fTSl l. ( fTSI l and ([TtT i is 
optimal for Problem P2 over the last N — i^e epochs. First, 
we show ~ Li,i = iee + 1, • • • , by contradiction as 
follows. Suppose that the optimal solution P{t) contains an 
"off" period with + Ai°^) C (tj„,T], i.e., P{t) = 

G + At°ff). 

Due to the definition of ( fTTT i. it follows immediately that 



- a > Pp,p,,yi > «e 



(46) 



Therefore, we can always find a time duration with P(<) = 
P™ > Pee,t e (<°",r" + Af°") C and construct a 

new policy P(t) with P(t) P°" ^^^^^ a, t G 

+ At°") U + 6). Note that we have chosen 

S to be sufficiently small so that + S) C + 

Ai°^) and P°'^ > P^e- The energy consumed by P{t) during 
(t°",£°" + At°")U(£°^,£°^+J) is P = (P™ + a)(At™+5) = 



(P°" + a)At°", which is same as the initial policy P{t). The 
throughput for the newly constructed policy P{t) and initial 
policy P{t) during + At°") U , + S) are 



Pi = P(P™)(Ai™ + 5) = P(P™) 



P 



pon _|„ Q, 



and 



B2 = p(p°")a£™ = p(p™) 



p 



po 



respectively. Since P°" > P°" > Pee, and is mono- 

tonically decreasing as a function of x when a; > Pee, we 
conclude that Pi > P2. Therefore, the new policy achieves a 
higher throughput than the initial policy. 

Moreover, we need to check that the new policy also 
satisfies the energy constraint as follows. If > t°" + At°", 
or + Af°") and , + S) are in the same epoch, 

it is evident that the energy constraint is satisfied. If we 
cannot find an interval + Ai°^) latter than or in 

the same epoch as + At""^), i.e., all the allocated 

power prior to is smaller than Pee, in this case based 
on (|46] |, there must be some energy left at time . Since 
we selected S to be sufficiently small, we can still guarantee 
that the energy constraint is satisfied. Therefore, we prove that 

=P,,^ = V,e + l,...,iV. 

After determining = P;, i = iee + 1, ■ • • , -/V, we realize 
that Problem P2 for the last N — i^e epochs has the same 
structure as that studied in |[T2i|, and thus lfT2l Theorem 1] is 
applicable here. After some change of notation, we can prove 
that the solution in ( fTSl ), ( fTSl l and ( [Tt] ) is optimal for P2 over 
the last N — i^e epochs. 

Next, we show that the optimal solutions of Pi and P2 are 
optimal for (|6]l to complete the proof for Theorem 13.11 First, 
we can verify by following the similar contradiction proof in 
the above that any energy harvested during [0,ii^^) should 

Zee 

be used up in the first iee epochs, i.e., ^ (P* + a)l°^* = 

iee-l 

^ Ej . Therefore, the energy constraint for (^ is equivalent 
to 



1 ■ • ■ ; ^ee 



(47) 



E (P,+a)C< E ^J:* = *ee + l,...,iV. 



J=»ee+1 



J=»e 



Since the energy constraint is decoupled before and after , 
solving ^ is equivalent to optimizing Pi and over i = 
1, . . . , iee and i = iee + 1, ■ • . , separately. Thus, the optimal 
solutions for Pi and P2 are also optimal for (|6). Theorem 13.1 
is thus proved. 

Appendix D 
Proof of PropositionI4.1I 



The first and third properties of R{P{t)) can be directly ver- 
ified by the first and third properties of P(Q(i)), respectively. 
Thus, to complete the proof of Proposition 14.11 we only need 
to show the second property of P(P(<)), i.e., it is a strictly 



12 



concave function of P{t). Similar to IfTSl Appendix B], we 
show the proof of this result as follows. 

Since R{P{t)) is obtained as the optimal value of ( l23T l, 
which is a convex optimization problem and satisfies the 
Slater's condition IITtI . Thus, the duality gap for this problem 
is zero. As a result, R{P{t)) can be equivalently obtained 
as the optimal value of the following min-max optimization 
problem: 



R(P) = min max i?(Q) - u V 

K 



Q 



4 k 



P 



k=l 



(48) 



(49) 



K 



n'-^^P (50) 



k=l 



where we have removed t for brevity, and in (|49] l Q'-^-' = 
[Q['^\ . . . , Q^''] is the optimal solution for the maximization 
problem with a given /i, while in ( fSOl l /x'^^ is the optimal 
solution for the minimization problem with a given P. Since 
R{Q{t)) is a strictly joint concave function, the optimal 
solutions in the above must be unique. Denote u as any 
constant in [0,1]. Let ^'■^^^ and /i'^^^' be the optimal 

11 for i?(Pi), R{P2) and R{P3.) with P3 = wPi + (1 - uj)P2, 
respectively. For j = 1, 2, we have 

fe=i 

where strict inequality holds for ( |52] i if P, 7^ P3 since the 
optimal = 1,2, are unique. Thus, we have 



R{Pj) = P(Q 



< P(Q^ 



(^^'P,- (51) 



/i(^^)p,- (52) 



wP(Pl) + (l-Cj)P(P2) 



fc=l 



/^^)P3 



=P(a;Pi + (1 -a;)P2), 



(53) 

(54) 

(55) 
(56) 



where strict inequality holds for (|54] | if a; G (0,1). Therefore, 
R{P{t)) is a strictly concave function over P{t) > 0. The 
proof of Proposition 14.11 is thus completed. 



Appendix E 
Proof of Proposition s. II 

We prove this proposition by considering the following two 
cases. 

First, consider the case when i^e exists, i.e., i^e G 
{1, . . . , N}. In this case, setting Pee as the transmit power 
at time is optimal according to Table |T] Furthermore, we 

have j"^ ^ ~ a < Pee based on ( fill ), and thus 



Thus, (|29T l is equivalent to P*(0) = Pee, which is the optimal 
solution. 

Next, consider the case when iee does not exist, which 
implies that ^']=" f " - a > Pee,Vj = 1, . . . ,iV. Thus, we 
have 



> P.. 



min ( ^1=°^" - a), 

i=l,...,Ar Z.fc=i^fc 



and (|29l) is equivalent to P*(0) 

which is the optimal solution according to Table |I] 

From the above two cases, ( |29] l coincides with the optimal 
solution for P*(0). Thus, Proposition 15. II is proved. 
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